On the Relation between Statistical Properties of Spectrographic Masks and Recognition Accuracy

نویسنده

  • J. F. Gemmeke
چکیده

Missing Data Techniques (MDT) can significantly improve the accuracy of automatic speech recognition (ASR) for speech corrupted by background noise. The increase in recognition accuracy obtained using MDT is largely dependent on the estimation of spectrographic masks used to distinguish speech from noise. We present an analysis technique which enables us to compare two mask estimation techniques. By contrasting a sound-class independent and a sound-class dependent distance measure, we show that we can directly relate differences between masks to their difference in recognition accuracy using the sound-class dependent distance measure. Experiments on AURORA2 using an oracle mask and an estimated mask show that modifying the estimated mask in order to reduce the statistical differences with the oracle mask leads to an increase in word recognition accuracy.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition

Missing feature methods of noise compensation for speech recognition operate by first identifying components of a spectrographic representation of speech that are considered to be corrupt. Recognition is then performed either using only the remaining reliable components, or the corrupt components are reconstructed prior to recognition. These methods require a spectrographic mask which accuratel...

متن کامل

Smooth soft mel-spectrographic masks based on blind sparse source separation

This paper investigates the use of DUET, a recently proposed blind source separation method, as front-end for missing data speech recognition. Based on the attenuation and delay estimation in stereo signals soft time-frequency masks are designed to extract a target speaker from a mixture containing multiple speech sources. A postprocessing step is introduced in order to remove isolated mask poi...

متن کامل

Predicting the Use of Masks in the COVID-19 Based on the Systems Thinking, Personal - Social Responsibility, Moral Obligations and Individualism: An Approach of Consumer Behavior Theory

Since the release of COVID-19 epidemic in late December 2020, recommendations issued for personal protection by the World Health Organization and National Health Organizations around the world. The most prominent of which has the use of masks to prevent the spread of the virus. Despite the importance of this solution, many people still resist using the mask. Therefore, this study, by emphasizin...

متن کامل

Calculating the amount of personal protection equipment’s (masks and gloves) and investigating Tehran's people knowledge about its management during the outbreak of COVID-19 (spring 2020)

Background and Objective: Changes in the quantity and quality of waste produced as a result of compliance with health protocols are the result of the COVID-19 outbreak. The present study aimed to determine the quantity of personal protection equipment produced in Tehran and people’s knowledge of its management. Materials and Methods: The present cross-sectional and descriptive-analytical study...

متن کامل

شناسائی رابطه تقابل در گفتمان فارسی به کمک روش های یادگیری باسرپرستی

Discourse is a part of language that intend is used to communicate. A discourse relation recognition system can identify one or more relation between the textual units in a discourse. Like other languages, Contrast relation is a one of the available relations in Persian discourse. Contrast relation recognition in discourse is useful for generation and perception of discourse, paraphrasing and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007